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Abstract: To the best of the authors knowledge, there is little structural data avail- 
able on the AGAAAAGA palindrome in the hydrophobic region (113-120) of prion 
proteins due to the unstable, noncrystalline and insoluble nature of the amyloid fibril, 
although many experimental studies have shown that this region has amyloid fibril 
forming properties and plays an important role in prion diseases. In view of this, the 
present study is devoted to address this problem from computational approaches such 
as local optimization steepest descent, conjugate gradient, discrete gradient and New- 
ton methods, global optimization simulated annealing and genetic algorithms, canonical 
dual optimization theory, and structural bioinformatics. The optimal atomic-resolution 
structures of prion AGAAAAGA amyloid fibils reported in this Chapter have a value 
to the scientific community in its drive to find treatments for prion diseases or at least 
be useful for the goals of medicinal chemistry. 

Key words: Amyloid Fibrils; Prion AGAAAAGA Palindrome; Atomic-resolution 
Structures. 

1 INTRODUCTION 

Prion diseases are invariably fatal and highly infectious neurodegenerative diseases that 
affect humans and animals. Prion diseases are amyloid fibril diseases. Prion amyloid fib- 
rils are believed rich in /3-sheet structure and contain a cross-/? core. Many experimental 
works [IIEIEIEIEIEIEIIIIIIIH] show that the hydrophobic region AGAAAAGA of prion 
proteins (113-120) plays an important role in the conversion of amyloid fibrils. PrP 
lacking / deleting the palindrome (PrP 113-120) neither converted to PrP Sc (infectious 
prion) nor generated proteinase K-resistant PrP (2j El HH H3j. Brown et al. [H [3] 
pointed out that the AGAAAAGA peptide was found to be necessary (though not 
sufficient) for blocking the toxicity and amyloidogenicity of PrP 106-126. The peptide 
AGAA did not form fibrils but the peptide AGAAAAGA formed fibrils in both wa- 
ter and PBS pQ. Thus, the minimum sequence necessary for fibril formation should be 
AGAAA, GAAAA, AAAAG, AAAGA, AGAAAA, GAAAAG, AAAAGA, AGAAAAG, 
GAAAAGA or AGAAAAGA. According to Brown [lj, AGAAAAGA is important for 
fibril formation and is an inhibitor of PrP Sc neurotoxicity 
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In theory, for the sake of clarity, we use a program in Zhang et al. [16] to confirm 
that prion AGAAAAGA (113-120) segment has an amyloid fibril forming property. 
The theoretical computation results are shown in Fig. [TJ from which we can see that 
prion AGAAAAGA (113-120) region is clearly identified as the amyloid fibril formation 
region because the energy is less than the amyloid fibril formation threshold energy -26 
[El- 
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Figure 1: Prion AGAAAAGA (113-120) is clearly identified as fibril formation segment. 

However, due to the unstable, noncrystalline and insoluble nature of the amyloid 
fibril, to date structural information on AGAAAAGA region (113-120) has been very 
limited. This region falls just within the N-terminal unstructured region PrP (1-123) 
of prion proteins. Traditional X-ray crystallography and nuclear magnetic resonance 
(NMR) spectroscopy experimental methods cannot be used to get its structural in- 
formation. Under this background, computational approaches or introducing novel 
mathematical formulations and physical concepts into molecular biology can signif- 
icantly stimulate the development of biological and medical science. The author has 
introduced novel mathematical global and local optimization computational approaches 
to produce the atomic-resolution structures of prion (113-120) AGAAAAGA amyloid 
fibrils, which are in Section 2. Section 3 summarizes this article. 
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2 COMPUTATIONAL OPTIMIZATION APPROACHES 
AND THEIR AGAAAAGA STRUCTURAL RESULTS 



In 2007, Sawaya et al. got a breakthrough finding: the atomic structures of all amy- 
loid fibrils revealed steric zippers, with strong vdw interactions (LJ) between /3-sheets 
and hydrogen bonds (HBs) to maintain the /3-strands [12] . In this section, we will 
use suitable templates 20MP.pdb (the LYQLEN peptide derived from human in- 
sulin residues 1318), lYJP.pdb (the GNNQQNY peptide from the yeast prion protein 
Sup35), 3FVA.pdb (NNQNTF 173-178 segment from elk prion protein), 3NHC.pdb 
(GYMLGS segment 127-132 from human prion with M129), 3NVE.pdb (MMHFGN 
segment 138-143 from Syrian Hamster prion), 3NVF.pdb (IIHFGS segment 138-143 
from human prion), 3NVG.pdb (MIHFGN segment 137-142 from mouse prion) and 
3NVH.pdb (MIHFGND segment 137-143 from mouse prion) from the Protein Data 
Bank fl http:// www . r csb . org / 1) to construct some amyloid fibril models for the prion 
AGAAAAGA region (113-120). 

2.1 20MP, 1YJP MODELS 

The author used the unmerge, mutate, and merge modules of Insight II Qhttp://accefrys.com/| ) 
for 20MP.pdb, lYJP.pdb to build the 12-chain AGAAAA 20MP-Model (MODEL01), 
10-chain AGAAAAG lYJP-Model (MODEL02), and 10-chain GAAAAGA lYJP-Model 
(MODEL03) [14J. Then, using AMBER 10 [4J, the MODEL01-03 were refined by the 
hybrid of steepest descent (SD) and conjugate gradient (CG) - simulated annealing 
(SA) - SDCG methods, where SA phase made the MODEL01-03 reach sufficient equi- 
libration and stability. MODEL1 (Fig. [2]) belongs to models of Class 7 (/3-strand 
antiparallel, face=back, upup) of [12], and MODEL02-03 (Fig. [2j) belong to models of 
Class 1 (/3-strand parallel, face-to-face, upup) of [T2] . 

Sawaya et al. [12J proposed eight classes of steric zipper structures for peptide 
segments of fibril forming proteins. For each Class, Zhang p3] constructed the molec- 
ular structures for the hydrophobic region AGAAAAGA palindrome of prion proteins 
(113120). Besides the above successful MODEL01-03, the unsuccessful molecular mod- 
eling experiences are: (1) For Class 1, based on the NNQQNY peptide from yeast 
prion protein Sup35 (lYJO.pdb), a hexamer model with six AAAAGA chains can be 
constructed by SDCG but cannot pass SA; (2) For Class 2 (i.e. /3-strand parallel, 
face-to-back, up-up), a tetramer model with four AGAAAA chains can be constructed 
basing on the SNQNNF peptide of human prion 170175 (20L9.pdb) by SDCG but 
cannot pass SA; (3) For other Classes (yeast Sup35 GNNQQNY form 2, 20MM.pdb; 
yeast Sup35 NNQQ form 1, 20NX.pdb; yeast Sup35 NNQQ form 2, 20LX.pdb; hu- 
man insulin VEALYL, 20MQ.pdb; human Tau protein VQIVYK, 20N9.pdb; human 
A/3 GGVVIA, 20NV.pdb; human A/3 MVGGVV form 1, 20NA.pdb; human A/3 MVG- 
GVV form 2, 20KZ.pdb; bovine RNase SSTSAA, 20NW.pdb) in Sawaya et al. [12], 
the /3-sheet structure of prion AGAAAAGA palindrome cannot pass the SDCG phase. 
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2.2 3FVA MODELS 



Instead of using Insight II, Zhang et al. used the mutate module of the free pack- 
age Swiss-Pdb Viewer (SPDBV Version 4.01) (http://spdb v.vital-it.ch | ) and the hybrid 
discrete gradient (DG) simulated annealing method (i.e. DGSA) to build the prion 
AGAAAAGA amyloid fibril models [15J. The models were built based on the template 
3FVA.pdb of NNQNTF segment from elk prion protein (173-178). After the models 
were built, the refinements were done completely same as [14J. A six chains AGAAAA 
model could not successfully pass SA of Amber 10. However, two prion AGAAAAGA 
palindrome amyloid fibril models (Fig. [3]) - a six chains GAAAAG model (MODEL04) 
and a six chains AAAAGA model (MODEL05) - were successfully obtained. Compared 
with [14J, the variations of RMSD, PRESS, and VOLUME (DENSITY) of the 4,400 
ps' equilibration at 100 K are larger [15]; this might imply that DGSA is a little worse 
than Insight II. 

2.3 3NHC, 3NVE/F/G/H MODELS 

Replacing the DGSA of Subsection 2.2, in this Subsection we will use any optimiza- 
tion solver, which can accurately solve an optimization problem with 3 or 6 variables, 
to build the models; thus the methods in this subsection are very simple and general 
for any problems in molecular modeling research area. The model building templates 
are: 3NHC.pdb (GYMLGS segment 127-132 from human prion with M129), 3NVE.pdb 
(MMHFGN segment 138-143 from Syrian Hamster prion), 3NVF.pdb (IIHFGS segment 
138-143 from human prion), 3NVG.pdb (MIHFGN segment 137-142 from mouse prion), 
and 3NVH.pdb (MIHFGND segment 137-143 from mouse prion). The mathematical 
theory is described as follows. 

The atomic structures of all amyloid fibrils revealed steric zippers, with strong 
vdw interactions (LJ) between /3-sheets and hydrogen bonds (HBs) to maintain the /3- 
strands [12] . In mathematics, the potential energy mathematical formula for the vdw 
interactions between /3-sheets is 

VljW = ± ~ 1 (1) 
and the potential energy mathematical formula for the HBs between the /3-strands is 

WO = % ~ ^, (2) 

where A,B,C,D are constants given. When Vlj and Vhb are reaching their minimal 
values, the amyloid fibril structures should be in a most stable state. This is a molecular 
distance geometry problem (MDGP) [5j, which arises in the interpretation of NMR data 
and in the determination of protein structure [as an example to understand MDGP, the 
problem of locating sensors in telecommunication networks is a DGP. In such a case, the 
positions of some sensors are known (which are called anchors) and some of the distances 
between sensors (which can be anchors or not) are known: the DGP is to locate the 
positions of all the sensors. Here we look sensors as atoms and their telecommunication 
network as a molecule] . The three dimensional structure of a molecule with n atoms can 
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be described by specifying the 3-Dimensional coordinate positions x\, x<i, . . . , x n E M 3 
of all its atoms. Given bond lengths d^ between a subset S of the atom pairs, the 
determination of the molecular structure is 

(Vo) to find xi,X2,...,x n s.t. \\xi - Xj\\ = d^, (ij) E S, (3) 

where 1 1 • 1 1 denotes a norm in a real vector space and it is calculated as the Euclidean 
distance 2-norm in this paper. (|3]) can be reformulated as a mathematical global opti- 
mization problem (GOP) 

(V) minP(X) = E^es^dki " ^l| 2 " 4) 2 (4) 

in the terms of finding the global minimum of the function where Wij, E S 

are positive weights, X = {x\, x*i, • • • , x n ) T E R nx3 [10] and usually S has many fewer 
than n 2 /2 elements due to the error in the theoretical or experimental data fTT} [5]. 
There may even not exist any solution x\, x^ • • • , x n to satisfy the distance constraints 
in (|3]), for example when data for atoms z,j, k E S violate the triangle inequality; in 
this case, we may add a perturbation term — e T X to P(X): 

(V e ) minP e pO = E (i)i)e5 <%(||^ - Xj \\ 2 - 4) 2 - e T X, (5) 

where e > 0. So, the molecular model building problem is a problem to get an global 
minimal solution of (jSj). Specially, for the amyloid fibril molecular modeling problem, 
we find after mutations the hydrogen bonds between /3-strands are still maintained but 
the vdw contacts become very far. Thus, the dij in ([5]) should be the sum of vdw radii 
of atoms i and j. 

After mutations of 3NHC.pdb, 3NVE/F/G/H.pdb by Swiss-Pdb Viewer, we find 
the following least vdw contacts should be maintained respectively for the 3NHC, 
3NVE/F/G/H models. 3NHC: A.ALA3.CB-G.ALA4.CB, B.ALA4.CB-H.ALA3.CB 
(A.ALA3.CB and B.ALA4.CB (two anchors) and G.ALA4.CB and H.ALA3.CB (two 
sensors), MODEL06-08). 3NVE: A.ALA4.CB-G.ALA3.CB, B.ALA4.CB-G.ALA3.CB 
(MODEL09), or A.ALA4.CB-G.ALA3.CB, A.ALA2.CB-G.ALA3.CB (MODEL10-11). 
3NVF: A.GLY2.CA-H.GLY2.CA, A.ALA4.CB-H.GLY2.CA (MODEL12), A.ALA4.CB- 
H.ALA2.CB, A.ALA2.CB-H.ALA2.CB, A.ALA2.CB-H.ALA4.CB (MODEL13-14). 3NVG: 
A.GLY2.CA-H.GLY2.CA, A.GLY2.CA-H.ALA4.CB, A.ALA4.CB-H.GLY2.CA (MODEL15), 
A.ALA2.CB-H.ALA2.CB, A.ALA2.CB-H.ALA4.CB, A.ALA4.CB-H.ALA2.CB (MODEL16- 
17). 3NVH: A.ALA4.CB-H.ALA4.CB (MODEL18-19). We look at the former as an- 
chor (s) and its partner as sensor (s). Thus in (jSJ) the variables are 3 or 6 and its dual 
variables are 1 or 2 or 3. Any optimization solver should be able to accurately solve the 
prion AGAAAAGA amyloid fibril model building problem. Then the MODEL06-19 
will be refined by the SDCG optimization program of Amber 10 and the perfect 3NHC, 
3NVE/F/G/H prion AGAAAAGA amyloid fibril 3NHC, 3NVE/F/G/H models were 
got (Fig.s0EJ. 

3 CONCLUSION 

Whenever traditional X-ray crystallography and NMR spectroscopy experimental meth- 
ods cannot be used to get any structural information of proteins, computational ap- 
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proaches or introducing novel mathematical formulations and physical concepts into 
molecular biology can significantly stimulate the development of biological and medi- 
cal science. The numerous optimal atomic-resolution structures of prion AGAAAAGA 
amyloid fibils reported in Fig.s[2]|8]have a value to the scientific community in its drive 
to find treatments for prion diseases or at least be useful for the goals of medicinal 
chemistry. 
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Figure 2: MODEL01 - MODEL03 for prion (113-120) AGAAAAGA amyloid fibrils. 




Figure 3: MODEL04 - MODEL05 for prion (113-120) AGAAAAGA amyloid fibrils. 




Figure 4: MODEL06 - MODEL08 for prion (113-120) AGAAAAGA amyloid fibrils. 
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Figure 5: MODEL09 - M0DEL11 for prion (113-120) AGAAAAGA amyloid fibrils. 




Figure 6: MODEL12 - MODEL14 for prion (113-120) AGAAAAGA amyloid fibrils. 




Figure 7: MODEL15 - MODEL17 for prion (113-120) AGAAAAGA amyloid fibrils. 
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Figure 8: M0DEL18 - M0DEL19 for prion (113-120) AGAAAAGA amyloid fibrils. 
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